Serial No. 09/918,733 

Amend, dated 01 August 2008 

Reply to Office Action of 02 April 2008 

Listing of Ciaims: 

This listing of claims will replace all prior versions and listings of claims in the subject 
application. 

Claims: 

1 . (Currently Amended) A speech recognition system comprising: 

a querying device for posing at least one query to a respondent over a telephone; 

a speech recognition device which that is configured and arranged to receive receives an 
audio response from said respondent over the telephone and to conduct conducts a speech 
recognition analysis of said audio response to automatically produce a corresponding text response; 

a storage device for recording and storing said audio response as it is received by said speech 
recognition device; 

an accuracy determination device for automatically comparing said text response to a text set 
of expected responses and determining whether said text response corresponds to one of said 
expected responses, wherein said accuracy determination device is configured and arranged to 
determine whether said text response corresponds to one of said expected responses within a 
predetermined accuracy confidence parameter and to flag said audio response so as to produce a 
flagged audio response for further review by a human operator when said text response does not 
correspond to one of said expected responses within said predetermined accuracy confidence 
parameter; and 

a human interface device for enabling said human operator to hear said flagged audio 
response and review the corresponding text response for the flagged audio response to determine the 
actual text response for the flagged audio response, either by selecting from a pre-determined list of 
text responses or typing the actual text response if no such match exists in the pre-determined list of 
text responses. 

2.-4. (Cancelled) 



BST99 1 575892-1 057622 0045 



-2- 



Serial No. 09/918,733 

Amend, dated 01 August 2008 

Reply to Office Action of 02 April 2008 

5 . (Previously presented) The speech recognition system of claim 1 , wherein said human 
interface device comprises a personal computer including a monitor for enabling the human operator 
to view said text responses and an audio speaker device for enabling the operator to listen to said 
flagged audio responses. 

6. (Previously presented) The speech recognition system of claim 5, wherein said 
querying device includes a program having an application file, said application file including code 
which causes the at least one query to be posed to the respondent, a list of expected responses and an 
address at which a file containing the received audio response will be stored in the storage device. 

7. (Previously presented) The speech recognition system of claim 1, wherein said 
querying device includes a program having an application file, said application file including code 
which causes the at least one query to be posed to the respondent, a list of expected responses and an 
address at which a file containing the received audio response will be stored in the storage device. 

8 . (Previously presented) The speech recognition system of claim 1 , wherein said human 
interface device includes a graphical user interface on which the human operator views said text set 
of expected responses, wherein after listening to said audio response, the human operator is able to 
select one of said expected responses from said text set of expected responses if the human operator 
determines that the response corresponds to one of said expected responses. 

9. (Previously presented) The speech recognition system of claim 7, wherein said human 
interface device includes a graphical user interface on which the human operator views said text set 
of expected responses, wherein after listening to said audio response, the human operator is able to 
select one of said expected responses from said text set of expected responses. 

10. (Previously presented) The speech recognition system of claim 9 wherein said 
graphical user interface comprises an application navigation window for enabling the human 
operator to navigate through said text set of expected responses, and an audio navigation window for 
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enabling the human operator to control playback of said audio response. 

1 1 . (Previously presented) The speech recognition system of claim 8, wherein said 
graphical user interface comprises an application navigation window for enabling the human 
operator to navigate through said text set of expected responses, and an audio navigation window for 
enabling the human operator to control playback of said audio response. 

12. (Previously presented) The speech recognition system of claim 10, wherein said 
graphical user interface includes a text entry window which enables the human operator to enter a 
text response if none of said expected responses from said text set of expected responses corresponds 
to said audio response. 

13. (Previously presented) The speech recognition system of claim 9, wherein said 
graphical user interface includes a text entry window which enables the human operator to enter a 
text response if none of said expected responses from said text set of expected responses corresponds 
to said audio response. 

14. -25. (Cancelled) 

26. (Previously presented) A method of transcribing an audio response comprising: 

A. posing a query to a respondent over the telephone; 

B. receiving an audio response from said respondent over the telephone; 

C. performing a speech recognition function on said audio response to automatically convert 
said audio response to a textual response; 

D. recording said audio response; 

E. comparing said textual response to a set of expected responses to said query, said set 
including a plurality of expected responses to said query in a textual form; and 
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F. flagging said audio response so as to produce a flagged audio response for further review 
by a human operator if the corresponding textual response does not correspond to one of said 
expected responses in said set of expected responses within a predetermined accuracy confidence 
parameter, 

G. a human operator listening to the actual audio response corresponding to said flagged 
audio response; and 

H. a human operator determining if one of said expected responses corresponds to said 
actual audio response; and 

I. if such determination of step H. is in the affirmative, selecting, from said set of expected 
responses, a textual response that corresponds to said audio response. 

27. (Cancelled) 

28. (Previously presented) The method of claim 26, further comprising: 

J. manually transcribing a textual response that corresponds to said audio response if such 
determination of step H is negative. 

29. (Previously presented) A method of transcribing an audio response comprising: 

A. constructing a speech recognition application including a plurality of queries and a set of 
expected responses for each query, said set including a plurality of expected responses to each query 
in a textual form; 

B. posing each of said queries to a respondent over the telephone; 

C. receiving an audio response to each query from said respondent over the telephone; 

D. performing a speaker-independent speech recognition function on each said audio 
response to automatically convert each said audio response to a textual response to each query; 

E. recording and storing each audio response; 

F. automatically comparing each textual response to said set of expected responses for each 
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corresponding query to determine if each textual response corresponds to any of said expected 
responses in said set of expected responses for the corresponding query; 

G. flagging an audio response so as to produce a flagged audio response for further review 
by a human operator if the corresponding textual response does not correspond to one of said 
expected responses in said set of expected responses within a predetermined accuracy confidence 
parameter as determined by said speaker-independent speech recognition analysis, 

H. a human operator listening to the actual audio response corresponding to said flagged 
audio response; 

I. a human operator determining if one of said expected responses corresponds to said actual 
audio response; and 

J. if such determination of step I. is in the affirmative, the human operator selecting, from 
said set of expected responses, a textual response that corresponds to said audio response, and 
flagging each audio response that does not correspond to one of said expected responses in said set 
of expected responses to the corresponding query. 

30.-32. (Cancelled) 

33. (Original) The method of claim 29, further comprising manually transcribing a 
textual response that corresponds to each flagged audio response if such determination of step J is 
negative. 

34. -36 (Cancelled) 
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